Ppdp-mlt: K−anonymity Privacy Preservation for Publishing Search Engine Logs
نویسنده
چکیده
In this paper we investigate the problem of protecting privacy for publishing search engine logs. Search engines play a crucial role in the navigation through the vastness of the Web. Privacy-preserving data publishing (PPDP) provides methods and tools for publishing useful information while preserving data privacy. Recently, PPDP has received considerable attention in research communities, and many approaches have been proposed for different data publishing scenarios. In this paper we study privacy preservation for the publication of search engine query logs. Consider an issue that even after removing all personal characteristics of the searcher, which can serve as links to his identity, the publication of such data, is still subject to privacy attacks from adversaries who have partial knowledge about the set. Our experimental results show that the query log can be appropriately anonymized against the specific attack, while retaining a significant volume of useful data. In this paper we study about problem in search logs and why the log is not secure and how to make log secure using data mining algorithm and techniques like Generalization, Suppression and Quasi identifier.
منابع مشابه
Layered Approach for Personalized Search Engine Logs Privacy Preserving
In this paper we examine the problem of defending privacy for publishing search engine logs. Search engines play a vital role in the navigation through the enormity of the Web. Privacy-preserving data publishing (PPDP) provides techniques and tools for publishing helpful information while preserving data privacy. Recently, PPDP has received significant attention in research communities, and sev...
متن کاملPublishing Search Logs - A Comparative Study of Privacy Guarantees
Search engine companies collect the “database of intentions”, the histories of their users’ search queries. These search logs are a gold mine for researchers. Search engine companies, however, are wary of publishing search logs in order not to disclose sensitive information. In this paper we analyze algorithms for publishing frequent keywords, queries and clicks of a search log. We first show h...
متن کاملPrivacy in Search Logs
Search engine companies collect the “database of intentions”, the histories of their users’ search queries. These search logs are a gold mine for researchers. Search engine companies, however, are wary of publishing search logs in order not to disclose sensitive information. In this paper we analyze algorithms to publish frequent keywords, queries and clicks of a search log. How do their formal...
متن کاملPrivacy preserving data publishing: Review
Privacy preserving data publishing (PPDP) methods a new class of privacy preserving data mining (PPDM) technology, has been developed by the research community working on security and knowledge discovery. It is common to share data between two organizations in many application areas. When data are to be shared between parties, there could be some sensitive patterns which should not be disclosed...
متن کاملSemantic microaggregation for the anonymization of query logs using the open directory project
Web search engines gather information from the queries performed by the user in the form of query logs. These logs are extremely useful for research, marketing, or profiling, but at the same time they are a great threat to the user’s privacy. We provide a novel approach to anonymize query logs so they ensure user k-anonymity, by extending a common method used in statistical disclosure control: ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012